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Abstract 


This report iS ia comparison of Language C and Pascal from several 
aspects, includitg histcry, language features, suitability for system 
programming, suitability for structured programming, and implementa- 
bility. 
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1. INTRODUCTION 


Pascal and = are in the same class of languages as PL/I, FORTRAN, 
ALGOL 60, ALGOL 68, and COBOL. That is, they are intended to be 
procedural and sompilable languages. This contrasts with the more 
specialized langiages such as APL (for arrays), SNOBOL (for strings), 
and LISP (for lists), which are extremely difficult or impossible to 
compile. This also contrasts with non-procedural languages such as 
RPG, CSMP or GPSS. It is assumed that the reader of this paper knows 
cne or more of the languages in the first group, and has some under- 
Standing of data structures, pointers, and recursion. This paper will 
primarily discuss the aspects of the two languages that make them 
unique, especiaity the things that make them easy or troublesome to 
use. 


1.1. THE BACKGROUND OF EASCAL 


Pascal was developed ina European programming language “tradi- 
tion", and shows its ALGOL heritage. Niklaus Wirth developed it 
primarily as a teaching tool (4). It has become a popular language in 
universities on pany machines. It also has influenced the design of 
many subsequent Languages, including the final four candidates for the 
proposed Departnent of Defense "Tronman" standard programming 
language. 

When miniconputers were the smallest computers around, the most 
popular language on them was BASIC. If another language besides 
assembler was us2d, FORTRAN was the usual candidate. With microproces- 
sors BASIC is still the beginner's language, but now PASCAL is often 
considered the b2st candidate for a larger language. 

The American National Standards Committee X3 has subcommittee 
X3J9 to prepare a proposal for a standard Pascal. So far, the 
proposed standara (6) is only a slight change from the Jensen and 
Wirth book (4) waich has been the de facto standard. Standardization 
of needed extensions to Pascal seems to be further off in time. 


1.2. THE BACKGROUND OF C 


C also comes originally from European influences, and was origi- 
nally based on a typeless language called BCPL, which was intended to 
be a simple and iighly rortable system programming language. BCPL was 
distributed wideiy when it first came out. At Bell Labs, a variant of 
BCPL was createl, called B, which was still a typeless language, and 
some experimental minicomputer code, the beginnings of the UNIX 
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operating system, were done in B (UNIX is a trademark of Bell Labora- 
tories). C was designed as a typed language based on B. UNIX was 
subsequently written almost completely in C. Since UNIX has become 
popular, especially in universities, the C language has become known 
in that fashion. C was intended primarily to be a system programming 
language, but aas become a language usable for general purpose 
programming. It has been ported to several machines, mostly by Bell 
Labs efforts. Some non-Bell compilers are available for various 
machines. It is not nearly as well known as Pascal. There is no user 
group specificaliy for Language C, although there is a UNIX user group 
that may have sone of that function. 


1.3. AN OVERVIEW OF PASCAL 


Pascal is a relatively small language which has some _ block 
structure, recursive functions and procedures, arrays, pointers, and 
data structures, andthe ability to do simple input/output. Unusual 
features, comparzd with PI/I or older languages, are the ability to 
define new data types, and to specify variables that have limited 
subranges of scalar or integer values. An unusual data type is the 
power set of a tinite range of scalar values; it turns out to bea 
very useful equizalent cf bit strings. | 

All declarations must appear before any executable statements 
that refer to the declared items. The form of the language is such 
that the compilec can be a one pass compiler without much difficulty. 
The language has some block structure in the ALGOL or PL/I sense, and 
only has the equivalent of PL/I automatic storage and a kind of based 
storage, called “heap" storage. All items are required to be 
completely declaz:ed. This implies that if a pointer is declared, then 
the type of the value it points to must also be declared. There are no 
defaults in the language. Like ALGOL, the standard language implies 
that separate coapilations are not easy to provide, aithough there are 
extensions ta sone Pascal compilers that allow this. 

Basic data types include characters, integers, "reals" (float- 
ing), Boolean, ssalars, and subranges. Structured data types include 
arrays, records (Similar to PL/I structures), pointers, sets, variant 
records (a sort df primitive ALGOL 68 union), and files. | 

Pascal has poth assignment and procedure call for simple control 
structures. Comsosite control structures include BEGIN...END (which 
does not denote a block), FOR, WHILE and REPEAT loop statements, and 
IF...THEN...ELSE and CASE for alteration statements. There is a 
primitive GOTO statement. 

Pascal has the expected relational and arithmetic operators, and 
some built in arithmetic functions. Except for conversion from integer 
to floating, aii conversions must be explicitly written. Pointer 
generation and iereferencing is explicitly done, similarly to PL/I. 
There are no array or structure operations except component selection 
and assignment. 7 

Input/output is done with built-in functions and procedures. 
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1.4. AN OVERVIEW OF C 


C allows s2parate compilation, and so interconnection of func- 
tions is related to whether they are in the same file when compiled, 
and whether funztion names are declared external or not. Functions 
cannot be nestei within functions (C has no procedures), although 
block structure ioes exist. Thus C has an overall program structure 
rather like that of FCRIRAN with named common. Unlike FORTRAN, C 
allows recursion, and has the equivalent of automatic, static, and 
based storage. 

Basic data types include character, two sizes of integer, two 
sizes of float, unsigned integer, and bit field. Composite types 
include array, structure, union, and pointer. C recently has hada 
type definition facility added to the language, although for a long 
time it did not have it. Boolean is not separate from integer data 
types in principle. 

The basic sontrol structure of the language 1s the expression; 
assignment iS an expression. If a function is called without using its 
returned value, the value is simply discarded or ignored and is not 
considered an ercor. Thus functions can be used as procedures. 

Composite control structures include a grouping statement equiva- 
lent tc the PL/I BEGIN;...END; and loop statements such aS a rather 
general FOR, and a WHILE. There is a conventional IF statement, anda 
primitive SWITCH statement that is a kind of computed GOTO which looks 
rather like a CASE statement. There is a simple GOTO statement, and 
statements for repeating or ending a loop from within the middle of 
the locp. 

There are aore oferators in C than in Pascal, and automatic 
conversion betwen all basic data types is done, except for pointer. 
Automatic generation of fointers or dereferencing of pointers is done 
in many circumstinces, unlike Pascal. There are no structure or array 
operators other than component selection. 

Input/foutput is not part of the language, and is done typically 
with library routines. 

C has a sSiaple preprocessor style macro language that provides 
file inclusion capabilities, abbreviations, and alternative selection 
of code based upon programmer defined parameters. 


2. DETAILED LANGUAGE COFPARISONS 


In this section we will examine in some detail the significant 
features of C and Pascal, stressing where the two languages are 
Signiticantly ditferent frem each other or from more familiar program- 
ming languages. Note that both languages were influenced by avail- 
ability to their designers of 96 character ASCII terminals. Both use 
square brackets and curly brackets, for example. 
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2-1. THE OVERALL LANGUAGE STRUCTURE 


Unextended Pascal effectively insists on the complete program 
being compiled all at once, whereas C can have groups of functions 
compiled separately. The order of writing program parts is’ much 
Stricter in Pascal than in C. The Pascal program order is as follows: 


PROGRAM name (files used by program) 
LABEL list of labels; 
CONST constant declarations such as PI = 3.141592 
TYPE type daclarations such as 
FLOAT = REAL; 

COLOK = (RED, ELUE, GREEN, YELLOW); 
VAR all variable declarations except those local 

to prozedures or functions declared below 
All proceduce and function declarations 
BEGIN 

the body of the main program 
END. 


The LABEL, =ONST, TYPE and VAR sections are omitted if there is 
nothing tc declare in them. Built-in functions and procedures of 
Pascal do not bave to be declared in order to be used. A function or 
procedure has the same form as that shown by the program, except that 
the END has a s2micolon instead of a period, and instead of PROGRAM, 
the keywords FUNZ TION or PROCEDURE are used, along with declaration of 
the types of arjyuments, and the value returned if it is a function. 
Within any function or procedure, further functions or procedures can 
be declared. 

The CONST declaration provides a way of naming constants. LABELs 
must be declared before they appear in the text. TYPE declarations can 
be viewed as an extension mechanism, or more Simply as an abbreviation 
mechanism for deslarations. 

The structuce of a C program is niet freer than that of Pascal. 
It is loosely of the following forn: | 


declarations 
function 
declarations 
function 
etc. 


Essentially, deciarations outside of functions are normally equivalent 
to PL/I EXTERNAL declarations. Also all functions are externally known 
when the program is loaded with other separately compiled programs. It 
1s not possible to nest functions within functions, unlike Pascal. 
This means there can be a name conflict problem. On the other hand, 
the grouping statement (denoted by curly brackets), can be a block 
anywhere, with its own declarations of automatic variables, just as in 
PL/I, and unlike Pascal. Essentially all variables must be declared 
before they are referenced. 
A C function has the following forn: 
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hame(argument list, if any) 
argument deslarations, if any 


declarations and statements, if any 


} 


Note that tae looser structure of C allows the easy inclusion of 
routines frcm other sources, such as libraries of functions. This is 
not so easy for Pascal, where it may be necessary to split apart the 
global declarations from the functions. Combination of C programs is 
also aided by th> ability to compile them separately, unlike Pascal. 


2.2. THE DATA SIKUCTURES 


Under the heading of data structures, we will briefly discuss 
basic types, sush as float and integer, and structures, which are 
combined from basic types. Then storage classes will be discussed. 
Operations that san be performed on specific types will be discussed 
in following sections. 


222.1. BASIC TYPES 


In Pascal some attempt was made to define the language indepen- 
dently of specitic machine word sizes. The approach was not that of 
PL/AL, where th2 arithmetic precision rules are machine independent. 
Instead, the proyrammer has available a standard constant MAXINT that 
tells him the iateger precision, so that he can write programs in 
terms of that, and achieve machine independence if it is important. 
There is only on? precision of integer, although the declaration of a 
variable as beiag ina  subrange will allow the compiler to compile 
shorter precision, where the compiler writer sees fit. There is no 
comparable precision information about real arithmetic. 

In Pascal, sharacters are recognized as being machine dependent. 
There is a function ORD(C) that returns an integer indicating the 
relative order oi the character argument in the character set. CHR(TI) 
accepts an inteyzr, and returns the character that is the Ith charac- 
ter of the charaster set. With care, it is possible to write programs 
that are independent of the character collating sequence of the 
specific inplemeatation of Pascal. Pascal apparently does not promise 
to implement any particular minimum character set as values of charac- 
ter variables. Note that both incC and Pascal, a character variable 
only hclds one saaracter at a time, and to have strings of characters, 
character arrays must be used. 

In contrast to Pascal, C is more machine iependent, since its 
types are derivel from the natural operand sizes and types available 
on the PDP-11. They also happen to correspond well to those on the IBM 
370 or the Seri2s/1. They include two sizes of integer, SHORT and 
LONG, and INT is the usual declaration if the choice between the sizes 
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is left up to tae compiler. All that is promised is that size(SHORT) 
<= size(INT) <= size (LONG). The size of INT must be at least the size 
of an address. lhere is also an unsigned integer which is the same 
size as INT. Cnaracters are variables that hold one character, and 
otherwise can be treated as integers. Note that Pascal allows compar- 
ison of arrays of characters (if of the same size) in a single opera- 
tion, whereas C does not. There are two sizes of floating point, 
FLOAT and CCUBLE. 

C has no Bodlean data type, and uses the distinction between zero 
and non-zero for this, for all basic data types except floating point. 
Pascal has Boolean predefined as a scalar data type of two elements, 
TYPE BOOLEAN=(FauSE, TRUE) and ali operators for scalar data types are 
usable on type Boolean. | 
| Pascal has two data types which are not in C. Any Scalar type can 
be defined by enumeration of its elements. For example, we could 
declare a new data type called COLOR as’ follows: TYPE COLOR=(YELLOW, 
RED, ORANGE, GREEN, BLUF, PURPLE), and then declare variables of that 
data type. The identifiers inthe parentheses are then used as 
constants of that data type. The order of these constants 1s known to 
the relational operators, so that RED<ORANGE is true. A variable of 
type COLOR would hold orly one color at a time. 

Pascal allows the definition of subranges of any data type (that 
is, integer or scalar) that has successor and predecessor functions. 
Thus we could declare 

VAR Lip alg Ke Ow 99°: 

ROOM, WALL: RED .. BLUE; Fi 
I, J and K tak2 on only subranges of integer values, and could in 
principle be stored ina byte in the IBM 370. Since the data type 
COLOR has only six possible values, it could be represented by integer 
values 0 to 5 of 1to € in the implementation of the data type. The 
variables ROOM aod WALI would take on only four of the six possible 
colors, and in principle their values could be stored in 2 bits. | 

In C there is a data type called a bit field, which has limited 
usage. BaSicaliy it is unaligned unsigned integer data which is not 
allowed to overiap integer sizes or boundaries. They are subparts of 
integers, and wave few cther operations than assignment or value 
accessing defined for them. They allow machine dependent accessing of 
bits. | 


202.2. COMFLEX wy PES 


Both languages have arrays, and use square brackets to designate 
subscripts. In both cases, strictly speaking, arrays are one dimen- 
Sional, but any data type, simple or complex, may be components, 
including arrays. Thus an array of arrays is equivalent to a single 
two dimensional array. Thus in an array of arrays of integers, the 
expression A{2][3] will access the element at row 2 and column 3, if 
it is viewed in the conventional manner. In Pascal, this may _ be 
abbreviated as A[2,3]. | 

C may not pass arrays or structures as function arguments, or 
return them as values, whereas Pascal can. On the other hand, both 
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languages can pasS oor return pointers to arrays or structures. It is 
possible tc leav2 undefined the size of arrays in C, where it must be 
declared in Pascal. This is a severe restriction in Pascal, because it 
makes it impossible to define an array handling routine that is 
independent of the actual size of the array to be manipulated. Thus 
the same routine could not invert both a 10x10 array anda 11x11 
array, unlike in PL/I Or FCRIRAN. This also causes problems in string 
manipulation, since strings are arrays of characters. Thus there are 
strong interests among Pascal users to extend the language to allow at 
least a dope vector style of handling arrays, as is done for PL/I. 

Both C and Pascal have data structures, of a power similar to 
PL/I's, although not using level numbers. For example, in Pascal we 
could define: 

TYPE COMPLEK = RECCRD REALPART, IMPART: REAL END 
where the data type (the structure) has two components of the same 
data type REAL. Accessing cf components is as for PL/I; if we declare 

VAR X COMPLEX; 
then we may acc2ss the imaginary part of the variable with X.IMPART. 
Although we show COMPLEX as a new data type, the variable X could have 
been declared instead as 

Xs RECORD REALPART, IMEFART: REAL END; 

The equivalent declaration in C is: 

struct { float realpart; float impart} x; 

As can be seen, the order of types and names in Cis’ reversed from 
that of Fascal. | | | | 

Tt is possible in both languages to define variables that can 
have one data type at one point, anda different data type at another 
point. In Pascal this is called a variant record, and in C, a union. 
In both languages, they are declared very similar to structures, and 
the "component" names are used to designate what the possible value 
types are, and access of a component implies that that type is what is 
currently stored in the variable. In C an example is declared as: 

union { int p; float r} jy; | | 
The variable y can hold either an integer, or a floating point value, 
but only one at ia time. To access it for the int value we say y.p, and 
yer accesses the value as if it were floating. Neither language checks 
that ycu are in cact using the correct accessing method for the value. 
The storage siz> and alignment is the maximum necessary to hold the 
largest and most strictly aligned value declared. 

Pascal has a complex data type, called SET, which is very useful, 
and is not found in language C. A set may be declared to be built out 
of items of a finite set of data, typically a scalar data type, ora 
Subrange. The nuaber of items cannot be more than the maximum set size 
allowed by the sompiler, which often is about 60 bits or so, for 
historical reasons. An example will be helpful: 

TYPE FRUIT = (APPLE, ORANGE, BANANA, GRAPE); 

VAR FRUITBASKET: SET CF FRUIT; 

Subsequently we aight set the variable FRUITBASKET to a particular set 
of the scalar vartues declared above: 

FRUITEASKET == { ORANGE, GRAPE ]; 

After the assignnent, the set will contain the two values, ORANGE and 
GRAPE, but not tae other two possible values. In practice, a set is 
represented by one bit for each possible type of thing that can be 
stored in it. fae value of scalar type FRUIT can be stored in two 
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bits, if we declared variables of that type. It has four possible 
values. A set of that type would require four bits, one for each 
possible type of fruit that might be stored in the set variable. 
Again, although the space savings is possible in principle, most 
Pascal compilers implement only one size of set. - 

Pointers in both languages must have declared the type of value 
to which they point. Thus a pointer to an integer is not the same data 
type aS a pointer to an array of characters. The checking of the 
pointer tyre is auch stricter in Pascal than in C, but both are much 
stricter than wnat can be checked in PL/I. In Pascal, a pointer may 
be created only ay calling a built-in function NEW, which does roughly 
the equivalent ot PL/I ALLOC. That is, it calls a routine like GETMAIN 
to supply the storage to be pointed to. This type of storage is 
called "heap" storage. In cC, this is possible, but it is also possi- 
ble tc ccempute a pointer to a variable (or component of a variable) of 
any storage class. This corresponds to the ADDR function of PL/I. 
Thus C is much aodre flexible than Pascal in this respect. In effect, 
in Pascal fpointec variables can only refer to based storage (in FPL/I 
terms), whereas in C, pcinters can be to any class of storage. 

In Pascal, iunctiors and procedures may have any type of argument 
value or variaple passed to them, and may return any type except 
function or proc:dure names, whereas in C only simple types or poin- 
ters to any typ2 can be passed or returned as values. Pointers to 
functions may also be passed or returned in C. 


2.2.3. STORAGE CLASSES 


Pascal has only one storage class, which is the eguivalent of 
PL/I automatic scorage, for non-pointer variables. For pointer varia- 
bles, there is only one class, which is roughly like based storage of 
PL/I, gotten and freed under program control. | 

By contrast, C has several more storages classes. It has the 
equivalent of automatic storage, which as usuai is acquired at block 
entry and freed at block exit. In both languages, a stack is a natural 
place for guch storage. cC also has static storage, in the PL/I sense, 
which may be eather known outside the compilation, or within the 
compilaticn only, or just within a single function. If Known outside 
the compilation, it is of course like PL/I EXTERNAL variables. Basic 
data type variables can also be declared storage class REGISTER, 
although there 15 no promise that the compiler will in fact keep the 
variable in a machine register. Such a declaration can be taken as 
advice to the compiler code optimizer. C also can provide the equiva- 
lent of based storage via its pointers, but is not restricted in what 
kind of storage a pointer variable can point to. 
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2.2.4. INITIALIZATION OF VARIABLES 


Pascal has 30 means of deSignating initial values of variables. 
This means that initialization must be done by assignment statements 
in the program body or a separate function for that purpose. By 
contrast, C alloss initialization specifiers ffor static, external and 
local (automatic): variakles. For local variables, this means that the 
variable declaration is not split apart from the setting of its 
initial value, even though the code generation may be the same as for 
Pascal. This 125 an aid to documentation. For globally accessed 
variables, which can be static incC, the advantage is that no assign- 
ment statements are generated to be executed at run time; in Pascal 
run-time code necessary for initialization. 


2.3. THE CCNTROL STRUCTURES 


Both languayesS can group sequences of statements together, so 
that they can be used as if they area single statement. This is 
important becaus2 most of the complex control structures are in terms 
of single statements as components, e.g. IF expression THFN statement 
ELSE statement. The grouping statement of Pascal is BEGIN ... END. 
That of C is {...}. The semicolon is uSed as a statement separator in 
Pascal (like ALGIJL and unlike PL/I). The semicolon is used to end a 
statement in  C, unless the "statement" is really a statement group. 
This is almost but not guite the PL/I convention. 


2.3.1. LCOE STATEMENTS 


The lcop statements are relatively similar in the two languages. 
The FOR statements are intended for initialization and iteration of a 
variable that can be used within the body of the FOR. Pascal cnly 
allows stepping ap or dcwn by a value of 1, and does not consider the 
iteration value after the end of the loop to be defined, which 1s a 
limitation on the user. C allows any initializer, any stepping 
statement, and any test fcr completion of the loop, and thus provides 
more generality. 

Bcth languayes have means for looping, with a test either at the 
beginning or at the ending of the loop, uSing WHILE or UNTIL keywords. 

C has a meaas of ending the execution of the body of a loop from 
in its middle. The BREAK statement causes controi to go after the 
loop. CONTINUE (a0t a well chosen keyword) causes the next iteration 
of the statement to begin, without executing the rest of the body of 
the loop. Pascal must get these effects by GOTO, or else by having 
the remainder ot the locp in an IF statement. 
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2.3.2. ALTERNATE SELECTION STATEMENTS 


Both statemazants have alternate selection statements of the IF 
form, With an optional FLSE. Both have a form of CASE statement, 
although Pascal's presents less difficulties to the user. An example 
of the Pascal focm is: 

CASE expression OF 

Li: statement; 
12: statement; 
L3, L4, L5:s statement; 
16: statement; 

END 
The “labels" L1, L2, etc., are really any constant of the data type of 
the expression. Unfortunately, there 1S no means for specifying what 
to do for values of the expression that do not have a label with that 
value. This meaas that all possible values that can occur must be 
enumerated explicitly. 

The form ot case statement in C is: 

Switch (ex pression) 

{ | 

 caseéel: statements 
break; 

caseé2: statements 
break; 


default: statements 


The expression must be integer valued, and only integer constants must 
be individual cases. If kreak is not used to jump out of the loop at 
the end of the statements handling a case, then execution will contin- 
ue in the statements for the next case! However, at least C provides a 
way (the defauit case) of not having to mention all possible cases 
that can occur. 


2-4. THE OPERATORS 


Both larnguajes have the expected collection of arithmetic and 
compariscn operators. Both have the ability to build complex "Boolean" 
expressions witi AND, C8, and NOT style operators, although their 
notations differ. Beycnd this, Pascal has some set operators, and 
some standard arithmetic functions, such as for cosines or logarithms. 

The Pascal set oferators include union, intersection and set 
difference, and comparison (Boolean valued) operators to test set 
inclusion, and s2t membership. For example, if there were two varia- 
bles SA and SB ot type SET OF FRUIT, defined earlier, we could write: 

IF APELE IN (SA + SP) THEN statement 
to test if APPL# is in either or both sets. | 

Language C was designed to generate code without having to 
compile subroutine calls to implement operations. Thus it does not 
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provide trigonometric and similar functions as a part of the language, 
although obviously they can be made easily available in libraries. The 
point is that sach functions are not standardized as a part of the 
language. Aside from the compilability restriction, and the lack of 
sets and operators on them, C has considerably more operators than 
Pascal. For aritametic cperators, there are left and right shift, and 
the Bcolean operators really do bitwise "and, "or", "not", and “exclu- 
Sive or" operations. Additionally, there are two more Boolean opera- 
tions, which return only O or 1 instead of doing bitwise “and" and 
"or" operations. They prcmise to test their first operand, and if the 
final cperator r2sult is known, the second operand is not computed. By 
contrast, in Pascal (and many other languages) it is undefined whether 
the second operand is computed or not, and therefore it is sometimes 
not safe to write some apparently natural expressions. For example, 
given that m 1S an array with subscripts ranging from 0 to 10: 

if ( (O<=i) && (i<=10) && (m{i]>20) ) statement 
is a safe statement to write, since the access of array m would only 
be done if the sabscript iis valid. The equivalent in Pascal could 
only be done Sarely with nested IF statements: 

IF (O<=I) AND (I<=10) THEN 

IF M[I]>20 THEN statement 
If the C programzer used & (bitwise and) instead of &&, it would be an 
unsafe statement to execute, for the same reason it is in Pascal. 

C has many increment and decrement operators. For example, the 
unary operators #+x and --x increment the value of x before presenting 
the value to tae rest of the expression they are in. xt+ and x-- 
increment or deccement the value in x after delivering the value. 

C haS many assignrent operations, and they all can be used as 
expressions. A simple asSignment is of the form variable = expres- 
sion. 

variatle =+ expressicn 
is equivalent to: 

variable = variable + (expression) 

This abbreviated form is available for most binary operators. 

C has available what was called a conditional expression in ALGOL 
60. It is effectively an IF statement that returns one of two values: 

Xx = ( a>bdb? as: b ) 

In the example, the larger of the values of a or bis stored in x. 
That particular sonditicnal expression is equivalent to PL/I MAX(A,B). 

Another ilmportant difference between Pascal and C is that C 
allows arithmetis to be performed on pointers in a limited way. I.e. 
additicn and suatraction can be done with the forms pti, p-i, p-p, 
++p, -~p, ptt, p--, p=+t 1, and p=- i, where p represents any pointer, 
and 1 any integer value. Normally the pointer is declared to point to 
an array of some data type. When the pointer is stepped by one, it is 
really made to pdint to the next item in the array. If it is an array 
of integers, wWaich might be four bytes in size, the real operation 
might be incrementing an address by four. When taking the difference 
of two pointers, they must point to the same data type. If the 
integer differens:2 is added to the second pointer, then the first 
pointer would be the result. Note that there is no check that you run 
off the end of the array in either direction, although in principle 
that check might be possible with dope vector style information. 
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2.5. DATA CONVERSIONS 


Essentially all ccnversions from one type to another in Pascal 
are done with explicit functions, except for conversion from integer 
to real. This contrasts dramatically with C, where all non-complex 
data types are freely converted from one to another, except for 
pointers. C is rather like PL/I in its freedom of conversions, 
although the basic types are all essentially arithmetic in nature. 
Aside from such conversicns, which tend to go from smaller to larger, 
and from integer to floating, when mixed, C also does referencing and 
dereferencing in a manner rather like ALGOL 68. This may be illus- 
trated by two examples. If an array iS written as an argument toa 
functicn, since arrays are not passed aS arguments, C assumes you 
meant to pass a pointer to the array (which is legal) and generates 
the pointer for you. Similarly if a pointer B is declared to point to 
an array, Since sabscrifting of pointers is meaningless, you may write 
B{ 5], and the compiler assumes that you wanted to follow the pointer 
to the array ani access its 5th component. Both of these constructs 
would be illegaz in Pascal. 


3. FUNCTICNS AND PROCEDURES 


The distinction between functions and procedures is that the 
latter does not ceturn a value. Pascal has both forms of subroutines, 
and cC only has functicns. Since C allows statements to be simply 
function calls, and then ignores any value returned since the call is 
not part of a larger expression, there is no important loss of capa- 
bility in C by having nc procedures. 

C only alloss basic types and pointers to be passed as arguments 
(no arrays or structures). Pointers may be passed which point to 
anything. There is a Similar restriction on values returned. The 
arguments are aiways passed by value, i.e. their value is always 
copied to the stack, as if the parameter variables are simply local 
variables cf the called program. In fact, the values may be changed in 
these parameter variables without affecting the original variable. [In 
order for a funstion tc be able to modify something that the caller 
passes to it, th2 pointer to that thing must be passed as an argument, 
and of course tiie function should declare the parameter as a pointer 
to the appropriate type. | 

In Pascal, rhe conventions are more like that of PL/I, i.e. there 
is a distinction between functions and procedures, and there ig a 
distinction between arguments passed by reference or by value. Parame- 
ters explicitly declared VAR must be variables on the calling side, 
and can have their value modified by the function or procedure called. 
Parameters not declared VAR can be any any expression, whose value is 
copied when pass2d to the function or procedure. Pascal can pass any 
data type to a ctunction or procedure, and can return any type except 
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functicn and pros2dure names. 

Pascal and = differ considerably in the strictness of their type 
checking fcr funstions and procedures. In essence, Pascal will strict- 
ly check at compile time the correspondence of argument and parameter, 
and the value returned with the declared type to be returned. 
Although ycu deciare parameter types in C, Since it is necessary for 
the use of thos2> parameter values within the function, no check is 
made against those jefinitions when compiling calls. There is no check 
to see that the tunction is returning the correct type either. Thus it 
is both easy to make mistakes, and easy to "cheat" deliberately, to 
get machine dependent effects (treating a pointer as an integer, for 
example). In tae UNIX system, although the C compiler does not do 
these sorts of checks, there is a program called LINT which will do 
so, when given a series of files of C programs that are intended to 
run together. 

C functions are much more flexible than Pascal functions or 
procedures in one sense. It iS possible to define any variable as 
having a data type with one of its dimensions (of array) as unknown in 
size. An example is a character array of indefinite length. It is the 
program's responsibility +o not go beyond the actual length of the 
array, and Since there is no dope vector information, the program must 
have scme way ot determining the actual length. For character strings, 
the usual convention in UNIX is to end the string with a null charac- 
ter (hex (©). Another convention might be to pass the length asa 
separate argument. Pascal has a much stricter control over its 
arguments and raturned values, aS mentioned above. In effect, it is 
not possible to pass twe different size arrays to a routine, and have 
it adjust to tn2 size. As aresult of this strictness, there are 
efforts to extend Pascal to allow this. One possibility is to add 
appropriate dope vector information for arrays, as is done for PL/I. 


G4. INPCT/OUTPUT 


Input/foutput is not specified as a part of the C language, since 
the designers telt that it was not appropriate. AS ae result, 
input/output is done Ly libraries of subroutines. In UNIX these 
routines are written in C, which is coded in machine dependent ways if 
necessary. 7 

Files are a data type in Pascal, but many of the usual operations 
valid for any otiaer data type are not valid for files. Instead, file 
operations are done fy ktuilt-in functions (whose argument rules 
violate the type checking restrictions imposed on user written rout- 
ines). Any partazsular file can be viewed as a Seguential file, a kind 
of array which can be processed only from front to back, with a window 
locking at only one comperent of the erray et a time. The wirdow is 
essentially a pointer to a buffer for the file. -Ilithough there are 
many cases of iaput/foutput where this suffices, it is not general 
enough for alt types ot files, e.g. variable lengtk record files or 
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random access files. 


5. EVALUATION OF THE LANGUAGES 


The two sanguages are evaluated from several viewpoints in 
following subsections, and the conclusion section has a brief summary 
of significant differences or tradeoffs taken by the two languages. 


5.1. LANGUAGE SizE AND EASE CF CCMPILATION 


It is clear that C is a larger language, mostly because it has 
more simple data types, and more operators on them, and more automatic 
conversions between them. Both languages were designed to be simple to 
compile. Fascal compilers often are based upon methods of compilaticn 
designed tc make the ccmpiler easily portable. Many compile to an 
intermediate lanyuage usually called P-code, which may then be inter- 
preted, compiliei into machine code, or perhaps processed a macro 
assembler. P-cole is not necessarily the best intermediate language 
for all machines. C conpilers normally generate machine code. Some C 
compilers have b2en designed to be portable. , 


5.2. LIBRARIES 


Since it is easy tc combine programs from several sources into a 
Single file int, and in Pascal it may be necessary to split arart 
such programs tod combine them with others, C is clearly superior to 
Pascal on this point. However, since C has no nesting of functions, 
and insists that all function names are external, there can be some 
name clashes, waich can cause difficulties. An extension of Pascal 
could be made that retains its type checking, but allows both nesting 
of functicns and a C-like freedom of ordering of items. Such = an 
extension would probably require giving up the ability to compile 
Pascal in cre pass. | 

The C preprocessor has the ability to include source files within 
a program teing compiled. Pascal does not have this capability. The 
preprocesscr also supplies a simple macro and abbreviation facility. 
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5-3. STRUCTURED £2 OGRAMFING 


For structured prcegramming, the languages can be compared on 
several points. One is the library issue already mentioned. 

In comparison of statement types, Pascal is ahead of C for the 
case statement, Since it allows a cleaner way of stating alternate 
cases. However, Pascal reeds a default statement in its CASE. 

It is cleac that the lack of nesting of functions inC iSa 
problem, and that Pascal is superior in this area. However, Pascal 
does not have any blocks other than whole functions or procedures, 
whereas C allows any grouping statement to be a block with its cwn 
local variables. Thus variables used only within a limited area could 
be declared in that area, and would exist (on the stack) only when 
that area is being executed. 

It is clear that the type definition facility of Pascal and C are 
very useful features fcr hiding details (encapsulation) of new data 
types as needed. With the C define facility, it is also possible to 
define macros t> encapsulate sequences of code that are generated 
in-line. In Pascal this can only be done by subroutine calls. IncC, 
for example, we may define an in-line MAX operation by: 

#define max(A, B) ((A) > (BE) ? (A) =: (B)) 

Then tke line: 
Gg = max(m-1,20) * j;3 
would expand as: 
q = ((m-1) > (20) ? (m-1) = (20)) * 53 

Although th2 macro facility is useful, it is not a substitute for 
the ability to iefine new operators, such aS is available in more 
recent languages like CLU. 


5e4. EASE CF LEAcNING 


It seems clear that on many counts, Pascal 1S much easier toc 
learn. C has many more tricky points, andas in APL or PL/I, almost 
any expression zaaS a meaning. Pascal will catch you on "strange" 
combinations of »dperaticns. Also the order of declarations is obscure 
in C, compared with Pascal, as the following illustration shows. 

INT * * ( * QQSV() ) CJ 3 

€ 5 4 Z 1 3 

QOSV is the variable, and the numbers shown below the C declaration 
illustrate the order of declaration. QQSV is a (1) function returning 
a (2) pointer that points to (3) an array of indefinite size whose 
components are (4) pointers to (5) pointers to (6) integers. In Pascal 
the order is lice that cf FL/I, from variable name to type, from 
structure to component cf structure, from pointer to thing pointed to, 
all from left to right. The only thing that can be said for C is that 
the declaration order tends to be written in the same pattern as when 
writing expressions to access the variable. E.g. * (**QQOSV(X))[ 39 ] 
would end up with an integer. This ability to build up many operators 
in a single statement, which can have several assignments, has the 
same lack of clarity as API "one-liners", 
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The Pascal equivalent of the above function declaration would be: 

FUNCTION QUSV(P: INT): @ ARRAY[O0..40] OF @ ® INTEGER; 

BEGIN : 

| function body 

END; 
The Pascal expression equivalent to the C expression is: 
QQOSV (X)@[30 ]0d0 (Note we have used "0" instead of the Pascal up-arrow.) 
The order of acs:ss is left to right, which is also the order of 
declaration. The conclusion is that Pascal is clearer both = for 
declaration and asage ir this language area. | 


525. SYSTEM FROGEAMMING 


It is clear that C was designed for system programming, and its 
data types reflect the machine it matured on, the PDP-11. With a few 
hardware dependeacies, gotten mostly by what Pascal would consider 
type violations, C can be used for almost all system programming 
Situations. It has no provisions for execution of privileged instruc- 
tions, which must be written in another language. 7 

Pascal was not designed for system programming, but it has heen 
extended or modiiied tc frovide such languages as Concurrent Pascal, 
and MODULA, whith are more suitable for system programming. They do 
not allow you to get as close to the machine as C, and are extensions 
in the direction of parallel processes, multitasking, etc. 

C was explicitly designed to not address the problem of multi- 
tasking, parallei programing, process synchronization, etc. These 
things are carri2d out ky functions called by the programs. This makes 
the C programs simpler, although some overall assumptions about the 
nature of tasks in UNIX, and also about how stacks are implemented, 
have combined to make this a reasonable choice. This approach has 
allowed all except about 10C0 lines of code of UNIX to be written in 
Ce , | 7 | 
Pascal is often translated to an intermediate language called 
P-code, which is the rather limited instruction set of a hypothetical 
stack oriented machine, which is then interpreted, or compiled into. 
some real machin? instruction set. There is no reason in principle why 
Pascal cannot b= compiled efficiently, which is one necessity for 
system programmnug. For example, a set should not take up more room 
than needed, but usually most compilers do not try to optimize such a 
thing. : 

The main rastricticns on Pascal, compared with C, for system 
programming, are the lack of external and static variables. Also it 
would help to be able to point to other than "based" (heap) storage. 
Pascal can pass Function and procedure nameS as parameters, but cannot 
store pointers t> functions in variables. C (and PL/I, for example) 
can do this. Tnis facility allows one program to dynamically load 
another, or store which function is associated with some resource 
Without having to compile those functions together. The ability to 
have dynamically specified dimensions for arrays is also necessary. It 
would ke very us2ful for structured programming to relax the order of 
declarations of items, e.g. intermixing constants, variables, and 
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types. It would still be reasonable to insist that an item must be 
declared before ase. | 

It is not cartain that Pascal can reasonably take advantage cf a 
machine that has several precisicns of arithmetic. There is some hope 
for integers, since one can declare an integer to have a subrange, and 
if it can be declared tc have a value in the range <-127..127, the 
compiler conceivably cculd allocate the integer in an 8-bit byte. 
However, there does not seem to be a way of using two floating pcint 
hardware precisions in the current language, and it may pay to extend 
it to allow declaration of two precisions of floating point. It is 
very useful to aave separate compilations of Pascal, which do not 
require later stinking that make the modules appear as if they were 
compiled tcgethec. Tt is clear that the pointer arithmetic of C is 
useful and powerful. It is a potential candidate for Pascal exten- 
sions. 


5-6. APPLICATION PROGRAMMING 


The fact that C has been used for many applications in the UNIX 
system shows that it can be a good language for this purpose. Part of 
this facility fot programming comes from the UNIX system itself, and 
perhaps much mote from the fact that libraries can be separately 
developed (such as for input/output), and then shared among users. 
Pascal does not provide this in most implementations. 

By contrast, Pascal is probably easier to use due to its cleaner 
language design, as long as its restrictions (lack of dynamic arrays, 
for example) do x0t get in the way. It certainly seems to be a more 
readable languag2 and therefore is a better candidate for application 
programming, wWita some minor extensions. 


6. LANGUAGE CHANGES 


In this section will be summarized the language changes that are 
recommended for 2eascal. It is felt that in the long run Pascal has a 
wider audience, anda bigger potential for use in a wide range of 
areas. This 1s because C has several liabilities that indicate it has 
already grcwn about as far as it can go. The C liabilities are: (1) 
expression syntax that is too complex, when taking into account 
implicit referensing and dereferencing and conversions, (2) a defec- 
tive case (SWITCh) statement, and (3) many ways of escaping the type 
Checking mechanisms such that unsuspected mismatches might not be 
easily detected. (4) It is all too easy to make mistakes in writing 
Operators, andend up with a different operator. For example, a 
common bug in C 1s writing IF (A = 1)... when it is intended to 
compare A with 1. But "=" 4s assignment, and "==""is comparison for 
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equality. This aistake is not caught by the compiler since assignment 
is an expression and the statement is therefore legal. The value of A 
in the example would ke tested for 0 value, and since it would be 
non-zero, it would be ccnsidered true. This sort of deficiency cannot 
be corrected without making drastic incompatible changes to CC. By 
contrast, extensions tc Fascal could be upwards compatible with the 
standard Pascal. 

The extensions recommended for Rascal include (1) external 
variables and szparate ccmpilation of functions and procedures, (2) 
the additicns ot blocks with their own static or local variables, (3) 
STATIC storage types, (4) the ability to point to more types of 
Storage, (5) the ability to store function and procedure names in 
variables, ( 6) “dynamic arrays" in the sense that their size need not 
be known by a funtction cr frocedure until the array is passed to it as 
an argument, (7): declaration of initialization of variables, (8) a 
default case for the CASE statement, and grouping of cases by 
subrange, (9) a method cf specifying precision of floating and integer 
variables such taat various precisions of real hardware can be easily 
taken advantage of, (10) the ability to declare things ina more 
flexible crder, so that functions written elsewhere or on library 
files can be aacluded more easily, (11) the means of ending the 
current loop iteration cr leaving the loop without having to reach the 
bottom of the statement, (12) making the order of evaluation of 
operands for AND and OR explicit, and (13) possibly some pointer 
arithmetic somewaat along the lines of C, although with the ability to 
check range violations at run time. 

Besides the above language extensions, I would want the compiler 
to generate reasonably efficient code, and with the option of omitting 
run time ckecking for certain things, in order to do a minor amount of 
system or machine dependent operating system coding. It would be 
necessary to generate code that would be reentrant, if that is possi- 
ble for the target machine, and to have the ability to have multiple 
modules and multiple prcgram stacks (one per task) in handling multi- 
ple tasks. Unfortunately some Pascal run time Support facilities grab 
all of free storage for their stack and heap storage. 

Several extansions and modifications to Pascal exist such. and 
MODULA and Concucrent Pascal. Some data abstraction languages such as 
CLU or EUCLID nave been influenced by Pascal. ADA (7), the newly 
proposed DOD standard, was influenced by Pascal and its descendants, 
and contains aii of the improvements to Pascal that were suggested 
above. ADA is a larger language than C or Pascal, to ‘judicious 
extensions to Pascal seem still worthwhile. It is perhaps too soon in 
the development of ADA to consider a subset of that language instead 
of an extension to Pascal. 
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7. CONCLUSIONS 


We can sSumaarize the differences between the two languages as 
follows: Pascal has fewer basic data types, and checks them more 
strictly. Fascais sets are better than C bit fields. Pascal does better 
type checking, ard as a result in C you can "cheat" more easily, or 
make inadvertent and undetected mistakes. For overall program struc- 
ture, C is more convenient and flexible, except that it does not allow 
nesting of functions, and insists on making them all external. C is 
Superior in pointer arithmetic, haS more conversions, but because it 
does it implicitiy in many circumstances, subtle mistakes can remain 
undetected for a long time. Pascal is safer in this regard. Pascal's 
Structured statements are less general, but also safer. C has more 
storage types, some of which are necessary for certain types of 
programming. 

In summary, Pascal is cleaner and easier to use without making 
subtle mistakes, but is smaller and more restrictive. For the languag- 
es as they curreatly exist, C has more power and is better for system 
programming and possibly fcr general purpose programming. But with 
extension of relatively simple sorts, Pascal would be better and 
cleaner for most purposes. I would recommend that Pascal extensions be 
made rather than using a standard C, mostly for subsequent maintaina- 
bility and readagility cf programs. 
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